Implementation of Sequential Loops in Data
نویسنده
چکیده
The implementation of sequential loops in dataaow computation had traditionally not received very much attention as it was assumed that most loops would be executed in parallel. This assumption was valid for earlier dataaow machines such as the MIT Tagged Token Dataaow Architecture (TTDA))2], Sigma-119] but not for the newest generation of dataaow machines including Monsoonn6], EM-4411] and Epsilon-227]. On the latter machines, sequential loops use less memory, and can execute in fewer instructions, albeit with lower parallelism then the parallel versions. This char-acterisation of sequential and parallel loops suggests that programs should have parallel outer loops and sequential inner loops. The run time of sequential loops therefore become signiicant in the overall run time. We also found that previous implementations of sequential loops can incur fairly high overheads. In this paper, we present two new ways of implementing sequential loops that have lower overhead then previous methods. We studied this problem in the context of compiling Idd14, 15] for Monsoon. 1 Introduction Loops executing on dataaow machines can be classiied into three categories according to the amount of inter-iteration parallelism allowed. At one extreme is an unbounded loop, where there is no artiicial bound on the number of concurrent iterations. Such a loop is compiled into a recursive procedure which together with non-strict procedure call convention lead to uncontrolled unraveling of the loop. Next, we have what is often referred to as k-bounded loopss5], where the number of iterations executing concurrently is bounded
منابع مشابه
Eecient Implementation of Sequential Loops in Dataaow Computation Boon Seong Ang Eecient Implementation of Sequential Loops in Dataaow Computation
The implementation of sequential loops in dataaow computation had traditionally not received very much attention as it was assumed that most loops would be executed in parallel. This assumption was valid for earlier dataaow machines such as the MIT Tagged Token Dataaow Architecture (TTDA))2], Sigma-119] but not for the newest generation of dataaow machines including Monsoonn6], EM-4411] and Eps...
متن کاملThe E cient Implementation of Sequential Loops inMultithreaded
In multithreaded computers, the per-iteration cycle cost is largely variable according to loop implementation schemes. Especially, when sequential loops with a lot of loop carried dependences in their bodies are unfolded, a number of value movements are required between frames, and a lot of synchronizations must be performed between threads, causing considerable overhead. However, it has been o...
متن کاملDesign and Implementation of a High Speed Systolic Serial Multiplier and Squarer for Long Unsigned Integer Using VHDL
A systolic serial multiplier for unsigned numbers is presented which operates without zero words inserted between successive data words, outputs the full product and has only one clock cycle latency. 
The multiplier is based on a modified serial/parallel scheme with two adjacent multiplier cells. Systolic concept is a well-known means of intensive computational task through replication of fu...
متن کاملDesign and Test of New Robust QCA Sequential Circuits
One of the several promising new technologies for computing at nano-scale is quantum-dot cellular automata (QCA). In this paper, new designs for different QCA sequential circuits are presented. Using an efficient QCA D flip-flop (DFF) architecture, a 5-bit counter, a novel single edge generator (SEG) and a divide-by-2 counter are implemented. Also, some types of oscillators, a new edge-t...
متن کاملDesign and Implementation of a High Speed Systolic Serial Multiplier and Squarer for Long Unsigned Integer Using VHDL
A systolic serial multiplier for unsigned numbers is presented which operates without zero words inserted between successive data words, outputs the full product and has only one clock cycle latency. The multiplier is based on a modified serial/parallel scheme with two adjacent multiplier cells. Systolic concept is a well-known means of intensive computational task through replication of func...
متن کامل